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Abstract 



■ We consider an optimal control problem governed by an ODE with 

, memory playing the role of a control. We show the existence of an op- 

timal solution and derive some necessary optimality conditions. Some 
examples are then discussed. 
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1 Introduction 

o . 

^ ■ The present paper deals with the optimal control of equations of the form 



x{t) = (/(., ut) = [ f{s, x{s)) dutis), t e [0, 1], a;(0) = Xq, (1.1) 

where the control is a family of probability measures t ^ ut such that ut is 
supported by [0,t] (nonanticipativity). 

The special case where = 6g(^t) is a Dirac mass corresponds to the 
deviated equation 

x{t) = f{9{t),xm)), x{0)=xo, (1.2) 
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where the control is the deviated function 9 satisfying 9{t) < t for all t G [0, 1]. 
We shall see that the optimal control of fll.ip is the natural relaxation of that 
of the deviated equation (11.21) . 

We will consider the minimization of the functional 

]{t,x{t))dt + h{x{l)) + (^j^ g{t,s)dut{s)^dt (1.3) 

where x is related to u by the state equation (11.11) . An example of relevant 
function g is g{t, s) = X\t — with p > 1 and A > 0. In this case the last 
term in the previous functional is A W^{6t, i^t) dt where Wp{6t,i't) is the 
p-Wasserstein distance between and the Dirac measure at t 

WP{6t,ut)= [ \t-s\Pdut{s). 

The interpretation of the Wasserstein term in the functional is therefore a 
penalization of long-term memory. 

Dynamics with lags as (11.21) or with more general memory structure as 
(11.11) arise in many different settings in engineering, economics, biology, mod- 
elling of financial time series... It is typically the case when studying the 
optimal performances of a system in which the response to a given input oc- 
curs not instantaneously but only after a certain elapse of time. We refer for 
instance to the classical book of Bellman and Cooke [2] for a general overview 
of such functional equations. A natural motivation for the optimal control 
of the delayed equation (II. 2p where the deviation 9 is the control is then as 
follows. Imagine some natural or industrial process is known to obey the 
delayed equation (II. 2p but with an unknown delay function 9 (think of the 
incubation period of some disease). If now some-possibly noisy-observation 
xq is available then the simplest way to estimate 9 is by least squares, that 
is by minimizing 

1 

\x{t) -XQ{t)\^dt 

with respect to 6*, x being linked to 9 by (11.20 . As already mentioned, this 
problem needs to be relaxed in a suitable way, which, roughly speaking, 
amounts to replace (II. 2p by (II. ip (see section |3]for a precise statement). An- 
other motivation for related problems in economics can be found for instance 
in ig. 

Now we claim that considering directly (i.e. without involving relaxation) 
the optimal control of (II. ip by vt is natural as well. Indeed, it is customary 
in time-series analysis to consider autoregressive dynamics. In discrete time. 
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an autoregressive process is a stochastic process that satisfies a relation of 
the form: 

K 

Xt = ^ akf{t - k, Xt_k) + St 

k=0 

where et are independent and identically distributed random variables. Such 
processes thus have some memory (the range of the memory being the integer 
K, called the order of the process), and the fact that the previous dynamics 
is given by a convolution captures some stationarity of the memory structure. 
This is a particular case of 

Xt = Y,Ms)f{s,X,)+et. 

s<k 

Formally passing to continuous time in the previous equation strongly sug- 
gests that estimating the memory structure of the process by least squares 
naturally leads to solve a problem of the form f ll.ip -( lL3|) . 

Let us insist here on the fact that the optimal control problem f ll.ip - fll.3p 
in which the memory structure is unknown and acts as a control is, as far 
as we know, somehow unusual. For variational or optimal control problems 
where a deviation or a memory structure is given and one looks for a classical 
optimal control, we refer to [TT], [5] (necessary optimality conditions) or 
[6] (dynamic programming approach leading to an Hamilton- Jacobi equation 
in infinite dimensions). 

The paper is organized as follows. Section H] is devoted to some prelimi- 
nary results. In section |3l we prove existence of solutions for (11. Sp and show 
that (II. 3p is the natural relaxation of an optimization problem posed over 
deviation functions. In section |U we establish optimality conditions. Finally 
section [S] is devoted to some examples. 

2 Preliminaries 

Let (t't)tG[o,i] be a Borel family of probability measures (meaning that t i— t- 
J g dvt is Borel for every continuous g) such that t'f ([0, t]) = 1 for every t. Let 
/ G C°([0, 1] X ]R'^,R°') satisfy the Lipschitz condition that there is a > 
such that: 

|/(t, x) - fit, y) \<k\x~ y\ Vt G [0, 1], V(x, y) e^" x W". (2.1) 

The next result gives the existence and uniqueness of a solution to the 
state equation (II. ip : 
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Proposition 2.1. Let f E C^{[0, 1] xM'^, R"') satisfy the Lipschitz assumption 
{i^t)te[o,i] above and Xq G M'^. Then the Cauchy problem 

x{t) = (/(., lyt) , Vt G [0, 1], a;(0) = Xo (2.2) 

admits a unique continuous solution x. Moreover x is Lipschitz continuous 
and satisfies \\ < M for a constant M that only depends on k, \xq\ 

and supig[o_i] |/(t,0)|. 

Proof. We proceed as in pj. Let X > k and equip C°([0, 1], M*^) with the 
norm 

||s||a := sup e^^*|x(t)|. 
te[o,i] 

For every x G C°([0, 1],R°'), define the continuous function Tx by 

Tx{t):=Xo+ [ {f{.,x{.)),u,) ds VtG[0,l]. 
Jo 

For X and y continuous and t G [0, 1], one deduces from f l2.ip and the nonan- 
ticipativity of {i^t)t- 

\Tx{t)-Ty(t)\<k\\x-y\\x ^ e^^ dus{r) ds <^\\x - y\\xe^' . 

One then deduces that T is a contraction and therefore has a unique fixed 
point in C". The second claim easily follows. □ 

In what follows, denotes the Lebesgue measure on [0, 1]. Let {ut)t be 
admissible for our problem which means that it is a Borel family of probability 
measures such that z/t([0, t]) = 1 for every t. It will be convenient in the sequel 
to define 7 := i/^ £^ that is the probability measure on [0, 1]^ defined by 

/ if{t,s)d^{t,s)= [ ([ if{t,s)dut{s))dt (2.3) 
^[0,l]2 Jo ^Jo ' 

for every G C°([0, 1]^, M). The admissibility of (z/t)t is equivalent to require 
that ^ := Vt ® £} belongs to the set 

r := {7 probability on [0, 1]^ : 7(T) = 1, 7ri#7 = C}}. (2.4) 

where T is the triangle T := {(t, s) G [0, 1]^ : s < t\ and vri^7 denotes the 
first marginal of 7. Given 7 G F, the disintegration Theorem (see [H]) enables 
one to disintegrate 7 as 7 = vt®C} with {yt)t admissible. In the sequel, under 
the assumptions of proposition 12. 11 the solution of the Cauchy problem (12. 2|) 
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will be denoted x^. Let us also remark that be characterized by the 

weak form of fl2.2p that can be conveniently written in terms of 7 as: 

(p{t) ■ x^{t) dt = — I (p{t) ■ f{s, Xj{s)) d'y{t, s), Xy{0) = xo (2.5) 

J[0,l]2 

for every if G C^((0, 1), M'^). The other obvious advantage of formulating the 
problem in terms of 7 G F is that F is weakly* compact. 

In the sequel, we shall always assume that / G C°([0, 1] x R'',R'^) satisfy 
the Lipschitz assumption fl2.ll) so that x^ is well-defined for every 7 G F. 

Lemma 2.2. // (7^) G F^ weakly* converges to 7, then x^^ converges to x^ 
m C^{[0,1],R'^). 

Proof. Setting Xn '■= x^„, it follows from proposition 12.11 and Ascoli-Arzela's 
theorem that (x„) is precompact in C^. Let y be the uniform limit of some 
(not relabeled) subsequence. Let ip G C^((0, 1),M'^), one has for every n 

ip{t) ■ Xn{t) dt = - ip{t) ■ f{s,Xn{s))d'Jn{t,s), Xn{0) = Xq. (2.6) 

^[0,1] 2 

Since Xn converges uniformly to y, and 7„ converges weakly* to 7, passing 
to the limit in (12. 6p one deduces from the continuity of / that y = x^, and 
by a standard compactness argument we deduce that the whole sequence 
converges to x^. □ 



3 Existence of optimal controls and relaxation 

We are now interested in the optimization problem 



infj(7) (3.1) 

7er 



where 



^(7) := / j{t,x^{t))dt + h{x.,{l))+ [ g{t,s) d-f{t,s). 
Jo Jt 

Theorem 3.1. We make the following assumptions: 

• the function j : [0, 1] x M'^ — )■ [0, +00] is Borel measurable, nonnegative 
and j{t, ■) is lower semicontinuous for a.e. t G [0, 1]; 

• the function : — )■ [0, +00] is nonnegative and lower semicontinu- 
ous; 
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• the function g : T ^ [0, +00] is nonnegative and lower semicontinuous 
on T; 

• there exists 70 G F such that J(7o) < +00. 

Then the optimal control problem f l3.ip has a finite value and admits a solu- 
tion. 

Proof. Thanks to lemma [521 one easily deduces from the assumptions above 
that J is weakly* lower-semicontinuous (for the lower semicontinuity of the 
last term express g as the supremum of continuous functions) and the exis- 
tence claim then follows from the weak* compactness of F. □ 

Let us define the set of admissible deviations : 

e := : [0, 1] ^ [0, 1] Borel, e{t) < t, for a.e. t G [0, 1]} 
and the optimal deviation problem 

inf F(^), with F{e) := J{6e ® C^) (3.2) 

that is the restriction of (13. ip to deviation (or delay) functions. Then (13. ip 
is the relaxation of (13. 2p in the following sense. 

Theorem 3.2. // j G C°([0, 1] x R'^), h G ^^(M^) and g G C^T), then for 
every 7 G F, there exists a sequence On in 6 such that F{6n) converges to 
7(7). In particular 

min 7(7) = inf F{9). 

7er See 

Proof. It follows from lemma 12.21 that J is weakly* continuous. Fix now 
7 G F; it is a well-known result in the theory of Young measures (see for 
instance Theorem 9.3 in [Ij) that there exists a sequence of Borel maps 
cr„ : [0, 1] I— 7- [0, 1] such that ® converges weakly* to 7. Defining 
Onit) := min{cT„(t), t} so that 9n G 6, it is obviously enough to prove that 
69^ ® converges weakly* to 7 to conclude. Let ip G C°([0, 1]^) and let us 
denote by u^, the modulus of continuity of ip. For every 5 > 0, we have 

1 

ip{t,en{t)) -ip{t,an{t))dt < 2||(^||ooi:'({t : an{t)>t + 6})+u^{5). 



From the weak* convergence of ® £^ to 7 and from the fact that the 
support of 7 is included in the triangle T, it is easy to deduce that for every 
6 > 0, C^{{t : cr„(t) > t + 6}) tends to 0. The weak* convergence of 60^ 
to 7 then follows directly. □ 
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It is easy to build from the previous result examples where there is no 
optimal delay function for f l3.2p . Indeed, let us take d = 1, /(x) = x, xq = 1, 
/i = 0, j{t,x) = withx*{t) = -l + 2e*/2 and g{t, s) = s(t-s). By 

construction x* = x^* for 7* = {So + St)/2 (g) £^ so that J(7*) = and 7* is 
optimal. If 6 was an optimal delay for fl3.2p then one should have F{6) = 
so that 9(t) G {0,t} a.e. and xq = x* (where slightly abusing notations xq 
denotes xsg(^c^). From the state equation one should then also have 

e*/2 = = -l + 2e^W/2^ a.e. 

which contradicts 9(t) G {0,t} a.e.. 

4 Optimality conditions 
4.1 A maximum principle 

In this section, we look for optimality conditions for fl3.ip . In what follows, 
we further assume that j, h and g are continuous and that 

• /i is of class C^, 

• j{t, .) and fit, .) are differentiable for every t G [0, 1] and V^j and D^f 
are continuous on [0, 1] x R'^. 

Let 7 = z/t ® ^-"^ be a solution to (13. ip . i] = fit ® E T and e G (0, 1); then 

i[J(7 + £(r/-7))-^(7)]>0. (4.1) 

As usual, our aim is to pass to the limit as £ — ?■ 0+ and to express the opti- 
mality conditions obtained this way in the form of some maximum principle, 
which will be achieved by introducing some suitable adjoint variable. To 
shorten notation, we set 7s = 7 + e{r] — 7), x = x-^ and 

Lemma 4.1. As e ^ 0^ , := 6^^{x^ — x) converges uniformly on [0, 1] to 
z, the solution of the linearized system: 

z(t) = {Az, ut) + {a, fit - i^t) , te[0,l], z{0) = (4.2) 

where A and a are the continuous functions: 

A{s):=DJ{s,x{s)), a{s):=f{s,x{s)), VsG[0,1]. (4.3) 
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Proof. First, let us remark that by the same arguments used in proposi- 
tion fl2.ip . the equation f l4.2p posseses a unique (Lipschitz) solution z. By 
construction we have -Ze(O) = and 

Ze{t) = ^^[/(., X + eze) - f{.,x))lu^ + (/(, X,), /ii - ut) (4.4) 

from which, by the Lipschitz assumption on /, we easily deduce that z^, is 
bounded in W^'^ (in particular || = 0{e)). By Ascoli-Arzela's 

theorem, Ze then possesses a cluster point y m. . Passing to the limit in 
(14.41) along a convergent subsequence then easily yields y = z and thus by 
compactness the whole family (-Ze)^ converges to z as e — O"*". □ 



Under our differentiability assumption, using lemma 14.11 and passing to 
the limit in (14. ip we then get: 



B ■ zdt + b- z{l) + {g{t,.),fit-iyt) dt>0. (4.5) 
Jo 

where z is related to 7 and 1] by the linearized equation (14.21) and 

B{s) := V.j{s,x{s)) Vs G [0, 1], b := Vh{x{l)). (4.6) 

To make condition (14.50 tractable, we shall introduce, as usual in control 
theory, an adjoint state; to do so we shall need a few notations and prelimi- 
naries. Recall that 7 G F is given by the disintegration 7 = ® jC^ and let 
u := T^2#l be the second marginal of 7 defined by 

^{s)dv{s)= f ^{s)dj{t,s)= [ {y^,ut)dt, V^gC°([0,1]). 

J[0,l]'2 Jo 

Invoking again the disintegration theorem, 7 admits the disintegration 7 = 
u ^ u* that is u* is a Borel family of probability measures and for every 
test-function <^ G C°([0, 1]^) one has 

^dj= [ {^(.,3),^:) du{s)= [ {^{t,.),ut) dt. (4.7) 
[0,1]2 Jo Jo 

Note that the requirement that 7 is supported by T implies that z/*([s, 1]) = 1. 
Since 7 has £^ as first marginal, (14. 7p also holds for test functions of the form 
(t, s) I— >■ q(t)(f{s) with (f continuous but q only L^. This enables us, for q E L^, 
to define {q^^D u as the finite measure defined by 

(y., (g, vl) v) := [ q(t) {if, u^) dt = [ q{tMs) d^(t, s) G C°([0, 1]). 



[0,1] 
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The adjoint of (14.21) will then be expressed using the measures u* and u, as 
usual, it is an equation of backward type and reads as: 

q = B-A^{q,u:)iy, g(l) = -6 (4.8) 

where the continuous functions B and A are defined by f l4.6p and fl4.3p re- 
spectively, A'^ denotes the transpose of A and the measure (g, u*) u is defined 
as above. A solution of f l4.8p is then by definition an i^^([0, 1],R'^) function 
q such that 

q(t) = -b- [ Bds+ [ A^(s) (g, z/,*) du{s) for a.e. t G [0, 1]. (4.9) 
Jt J[t,i] 

Such a solution is of course BV and the weak formulation of 04.81) reads as 
q-ipdt = -[ B-(pdt-b-(p{l)+ [ (A{s)ip{s) -qit)) d-f{t,s) (A.IO) 



for every G C^([0, 1]), M*^) such that, ip{0) = 0. Note that it is easy to see in 
this case that fl4.10p also holds for every test-function, ip e W^'°°{[0, 1],R'^) 
such that ip{0) = 0. The well-posedness of (14. 8 p follows from the result below. 

Lemma 4.2. Let B e C°([0, 1], R'^), A G C%[0, 1], M'^^'^) and 6 G M"'. Then 
Iji4-^ admits a unique solution q G BV. Moreover if z is the solution of Iji4-^ 
then ^ ^ 

/ B ■ zdt + b- z{l) = - q{t){a,fit-i^t) dt. (4.11) 
Jo Jo 

Proof. Take A > ||A||oo and equip -^^^((0, 1),R"') with the norm 

1 

e^*|g(t)| dt. 

In view of (14.91) . it is natural, for every q E to define Tq by 

Tq{t) ■=-b- [ Bds+ [ A^(s) (g, u^) dv{s) for a.e. t G [0, 1], 

Jt J[t,i] 

so that Tq G BV. Let qi and q2 be in and t G (0, 1), let e G (0, t) and ipe 
be a continuous function such that < (/^^ < 1, v^e = 1 on [t, 1] and = 
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on [0, t — e]. We then have: 



\Tqi{t)-Tq2{t)\<\\A\\^ / y,, {\qi - q2\,u:) du 







Moo {ipe,^r)\qi{r) - q2{r)\dT 



<\\A\\oo / z^r([t-£,r])|gi(r) -g2(r)|rfr 





< ||v4||oo / \qi - q2\ dr. 

Jt-e 



Lettine; e — )■ 0+ we get 



\Tqi{t) - Tq2{t)\ < \\A\\^ ^ |gi - gal dr. (4.12) 
Multiplying (14.121) by e'^*, integrating and using Fubini's theorem then yields 
llTgi-Tgall < e^*^ \q,-q2\dT^dt 

\qi{r) - q2{r)\{ f e^' dt] dr 







, ll^llOO II II 

< -^^m - 121 

Since we have chosen A > || A||oo) the map T is a contraction and the existence 
and uniqueness of a solution to (14. 8 p then follows from Banach-Picard's fixed 
point theorem. 

Let us now establish (14. lip . Using z (which is Lipschitz) as test-function 
in (I4.10p . we first get 



q.zdt = - B ■ zdt-b- z{l) + {A{s)z{s)) ■ q{t) d-f{t, s) 
Jo J[o,ip 

then using (14. 2 p yields 

q-zdt= / {A{s)z{s)) ■ q{t) d-f{t,s) + / q{t) ■ {a, /Jt - i^t) dt 
J[o,ip Jo 

which proves (14. lip . □ 
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Theorem 4.3. Under the assumptions of this section, if^y G F, 7 = Ut^C^ = 
V ® vl solves and x = then one has for C^-a.e. t 

spt(z/t) C argmax,g[o_4]{g(t) ■ f{s,x{s)) - g{t, s)}, (4.13) 

where q is the adjoint variable whose dynamics is given by 

q{l) = -Vh{xil)), qis) = V,j{s, x{s)) - DJ{s, x{s)f (g, u*,) z/(s). (4.14) 

Proof. Let •y = Ut <S) E T he optimal for (13. ip : then for every rj = fit^ C^, 
it foUows from (gSD, (gSD and fl4l4D that 

/ {q{t)- f{.,x{.))-g{t,.),ut~fit) dt>0 
Jo 

with q defined by (I4.14p . Since in the previous inequality, /it is an arbitrary 
probability measure supported on [0,t], (I4.13P directly follows. □ 

Remark 4.4. In the case where / is linear in x (that is f{s, x) = A{s)x) and 
j{t, .) and h are convex (so that J(7) is convex in x-y, but in general not in 
7), then the following condition is sufficient for optimality: 

spt{ut) C argmax,g[o^t]{g(t) • {A{s)x^{s)) -g{t,s)}, (4.15) 

for every rj E T and a.e. t G [0, 1]. Indeed, if 7 = z/t ® C^, x = x^, q is the 
adjoint variable defined as in theorem 14.31 and rj = fif^ ET,hj the above 
convexity assumptions and the same manipulations as before (integration by 
parts and disintegration), we get: 

Jijl)-Jin)> I Va^i{s,x{s)) ■ {xn{s) - x{s))ds + j g{t, s)d{r] - -f){t, s) 
Jo ^[o,ip 
1 

{q{t) ■ Axr^{.) - g{t, .), ut - /ii) dt 

which proves that (14.151) is a sufficient optimality condition. Of course, f l4.15p 
is not necessary in general and is difficult to exploit since it involves every 
admissible r]. However, we shall give a simple example in section |5] where the 
sufficient condition (I4.15P together with the necessary condition of theorem 
14.31 actually enables one to compute an optimal control. 

Remark 4.5. Let us indicate, without giving details, that one may obtain in 
a similar way as in theorem 14.31 optimality conditions for the optimal control 
of the slightly more general state equation containing a local term: 

x{t) = r]{t, x{t)) + (/(., x(.)), ut) , x{0) = xo. 
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Indeed, considering the minimization of the cost J (which has the same form 
as before), one gets, by similar arguments as previously, that any minimizer 
7 = z/t®£^ has to satisfy the statement of theorem 14. 3[ the only modification 
being that the adjoint equation now reads as 

q{s) = V,j{s,x{s)) - D,r]{s,x{s)fq{s) - DJ{s,x{s)f {q,u:)u{s). (4.16) 



4.2 Solutions with finitely many Dirac masses 

This paragraph is somehow independent from the previous one on optimal- 
ity conditons. A simple application of Caratheodory's theorem, gives the 
existence of optimal controls involving only finitely many Dirac masses. 



Proposition 4.6. Assume that f satisfies h2. 1\) and f and g are continuous, 
then for every j = Ut^C^ G F there exists rj = fit^C^ such that fit has finite 
support with at most cardinality d + 2 such that x^ = Xr, and 

1 .1 

g{t, s) diJt{s) = I g(t, s) dfit{s) a.e. t. 
Jo 

In particular, in the optimization problem liS. 1\) it is enough to optimize over 
measures in T of the form Vt ® with Vt cl convex combination of at most 
d + 2 (d + 1 if g = 0) Dirac masses. 

Proof. Let 7 G F, x := a;^ and define for every t G [0, 1] the compact set 

S,,t:={ifis,xi.s)),git,.s)) : s G [0, t]} C 
By construction, for almost every t one has 



(x{t), g{t,s)dut{s)^ eco{S^^t)- 



It thus follows from Caratheodory's theorem (see [TU]) that (x(t), Jq g{t, s) dut{s 
may be expressed as a convex combination of at most d + 2 points in Sx,t- 
Hence there exists a discrete probability fit on [0,t] with at most d + 2 points 
in its support such that 

i{t) = {f{-,x{-)),fJ't), / g{t,s)dut{s) = g{t,s)dfit{s). 

Jo Jo 

The fact that the discrete measure fit can be chosen measurable in t follows 
from standard measurable selection arguments (see for instance [7]). □ 
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5 Examples 
5.1 Scalar case 

We start by considering the scalar ODE with memory 

X = a (x, vt) , x{0) = 1 

where a > is a parameter and ut is an admissible control. It is clear that 
if we want to minimize a cost like 

J{l) = [ x^{t) dt 
Jo 

the best choice for the control is 7 = z/j ® £^ with z/j = 60 for all t G [0, 1], 
which gives 

a 

X{t) = l + at, Jmin = ^+2' 

Analogously, for a cost like 

J(7) = -x,(l) 

the best choice is 7 = z/^ (g) £^ with Uf = 6t for all t G [0, 1], which gives 

x(t^ 6 ; Jmin 6 • 

Take now the cost ^ 

7(7) = a Xj{t) dt — bxj{l) 



with a,b > and with no penalization in the use of memory (i.e. g = 0). 
The Pontryagin principle of theorem 14.31 gives the adjoint equation 

qit)=a-a{q,u:)uit), q{l) = b (5.1) 

and the necessary condition of optimality 

spt(z/i) C argmax,g[o^t] {aq{t)x^{s)} (5.2) 

since x^ is increasing, the latter is equivalent to 



St ifg(t)>0 
So if q{t) < 0. 



Thanks to remark HTil and the fact that x,, is nondecreasing for every admissi- 
ble T] then the conditions fl5.ip - fl5.2l) are in fact sufficient. Since g(l) = 6 > 0, 
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we necessarily have Vt = for t cfose to 1. Now, it is natural, in view of the 
previous considerations to look for an optimal control in the form 



So ifte(0,to) 
6t if t e (to, 1) 



where to ^ [0, 1] will be determined in such a way that the previous necessary 
and sufficient optimality conditions hold. For such a family of Dirac masses, 
a direct computation yields 

= to^o + 

and 

{^0 ^^fo,to] if -s = and to 7^ 0, 
6o if s = and to = 0, 

5t ifse(to,l) 

(note that z/* is only defined for z/-a.e. s). Integrating (15. ip on (to, 1] then 
yields 

g(t) = -+(6--)e"(i-*), tG(to,l). 
a \ a/ 

There are now two cases: either 

6 e" - 1 , , 

- > 5.3 

a ~ ae" ^ ^ 

in which case, q remains positive on (0, 1) and thus to = and = 6t for 
every t G [0, 1] is optimal. Or f l5.3p does not hold; in this case, set 

to = l--ln(^-) G(0,1). 
a \a — abJ 

Now let t G (0,to), integrating (15. ip between t and to, we simply get q{t) = 
O'it — to) so that q is negative on (0,to) and then Ut = for t G (0,to) and 
Ut = 6t for t G (to, 1) is an optimal control. 

5.2 A two-dimensional example 

Let us now consider the two-dimensional linear-quadratic-like problem that 
consists in minimizing 

.1 ^ 

- {a{t)x\t) + b{t)y\t)) dt 

with respect to the admissible control (z/^) when the state equations for x 
and y are 

x{t) = a (x, ut) , y{t) = 13 (y, n) 
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together with the initial conditions {x{0),y{0)) = {xQ,yo). The weight func- 
tions a and b are continuous (but not necessarily positive). We also assume 
that a > 0, /3 > and xq > 0, yo < which guarantees that x is increasing 
and positive and y is decreasing and negative. The adjoint equations read as 

qi = ax -a (gi, u*) z/, q2 = by - a {q2, i^*) gi(l) = ^2(1) = 0. (5.4) 



Furthermore, the maximum principle of theorem 14.31 gives 

spt(z/t) C argmax,g[oi]{gi(t)x(s) +q2{t)y{s)]. 

Since x is increasing and y is decreasing, we thus deduce that: 

qiit) > 0, q2{t) <0^ut = 6t, 
qi{t) < 0, q2{t) >O^Ut = 6o. 

Now, if a(l) > and 6(1) > 0, it is easy to deduce from (15.41) and the fact 
that spt(z/;) C [s,l] (and that = 0) that qi{t) < and q2{t) > 

for t < 1 sufficiently close to 1 and therefore the optimal z/'s have to satisfy 
Uf = 60 for t close to 1. 

If a and b are everywhere positive, then (15. 4p implies gi < and ^2 > 
on [0, 1) so that (not surprisingly) there is only one optimal u that is Ut = 60 
for every t. Now let us consider the case where a and b may change sign and 
let us look for conditions that ensure that qi{t) > and q2it) < for t close 
to so that optimal z/'s satisfy ut = 6t for t close to 0. 

Let us assume further that a < 1 and /3 < 1, then we straightly deduce 
from the state equation the estimates 

||a;||oo < 7^^, Iblloo < (5.5) 
i — a 1 ~ P 

using those estimates and the adjoint equation for qi, we also get 

llgilloo < Mk^:^. (5.6) 

(1 — ct)^ 



We then get 



"9i(0) = / axdt — a (gi,z/*)(iz/ 
Jo Jo 

< / a+xdt— / a^xdt 



JO 



1 — a 



2 



^ ^0 1 / a^dt — a_dt + 



1 — a Jq Jq [1 — a 
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so that qi (t) < for t close to as soon as 



/ a-dt>- / a^dt+— —. 

Jo 1 - a Jo (1 - «) 

A similar condition on b ensures that 52 (i) > for i close to 0. 
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